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Mesorhizobium opportunistum strain WSM2075 1 was isolated in Western Australia in 2000 
from root nodules of the pasture legume Biserrula pelecinus that had been inoculated with M. 
ciceribv. biserrulae WSM1271. WSM2075 1 is an aerobic, motile, Gram negative, non-spore- 
forming rod that has gained the ability to nodulate B. pelecinus but is completely ineffective in 
N 2 fixation with this host. This report reveals that the genome of M. opportunistum strain 
WSM2075 1 contains a chromosome of size 6,884,444 bp, encoding 6,685 protein-coding 
genes and 62 RNA-only encoding genes. The genome contains no plasmids, but does harbor a 
455.7 kb genomic island from Mesorhizobium ciceribv. biserrulae WSM12 71 that has been in- 
tegrated into a phenylalanine-tRNAgene. 



Introduction 

Biserrula pelecinus L. is an autogamous annual leg- 
ume species that is common, though never domi- 
nant, on coarse textured and acidic Mediterranean 
soils [1] and can often be found with other annual 
legumes including subterranean clover {Trifolium 
subterraneum) and serradella [Ornithopus] [2]. This 
reseeding legume was introduced to Western Aus- 
tralia in 1993 in a pasture legume breeding and 
selection program that sought to develop new pas- 
ture legume options for the sandy surfaced duplex, 
acidic soils in Western Australia, to improve soil 
fertility and farming system flexibility [1]. At the 
time of introduction, the Australian resident 
rhizobial populations were not capable of 
nodulating B. pelecinus [1,3] and a Mediterranean 
strain Mesorhizobium ciceri bv. biserrulae 
WSM1271 had to be used as an inoculant to estab- 
lish an effective nitrogen fixing symbiosis. After 6 
years of cultivation of B. pelecinus under field con- 



ditions, an isolate (designated WSM2075) was re- 
covered from root nodules of plants grown near 
Northam, Western Australia that displayed an inef- 
fective symbiotic phenotype [4]. Accumulated evi- 
dence revealed that WSM2075 had gained the abil- 
ity to nodulate (but not fix with) B. pelecinus by ac- 
quiring symbiotic genes from the original inoculant 
strain Mesorhizobium ciceri bv. biserrulae 
WSM1271 following a lateral gene transfer event 
[5]. Strain WSM2075 has now been designated as 
strain WSM2075T (= LMG 24607 = HAMBI 3007) 
and is the type strain for a new species described as 
Mesorhizobium opportunistum [6]. The species 
name op.por.tu.nis'tum. L. neut. adj. opportunistum 
reflects the opportunistic behavior of the organism 
to nodulate a range of legume hosts by acquiring 
symbiotic genes [4,5]. M. opportunistum WSM2075 T 
is competitive for nodulation of B. pelecinus but 
cannot fix nitrogen [4] and the finding of such 
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strains that have rapidly evolved in the soil pre- 
sents a threat to the successful establishment of 
this valuable pasture species in Australia [5]. 

Here we present a summary classification and a 
set of general features for M. opportunistum strain 
WSM2075 T together with the description of the 
complete genome sequence and annotation. Here 
we reveal that a 455.7 kb genomic island from the 
inoculant Mesorhizobium ciceri bv. biserrulae 
WSM1271 has been horizontally transferred into 
M. opportunistum strain WSM2075 T and integrat- 
ed into the phenylalanine-tRNA gene. 

Classification and general features 

M. opportunistum strain WSM2075 T is a motile, 
Gram-negative, non-spore-forming rod (Figure 1A 
and Figure IB in the order Rhizobiales of the class 
Alphaproteobacteria. They are moderately fast 
growing forming 2-4 mm diameter colonies with- 
in 3-4 days and have a mean generation time of 4- 
6 h when grown in half Lupin Agar [VzLA] broth 
[7] at 28°C. Colonies on VzLA are white-opaque, 
slightly domed, moderately mucoid with smooth 
margins (Figure 1C). 

Strains of this organism are able to tolerate a pH 
range between 5.5 and 9.0. Carbon source utiliza- 
tion and fatty acid profiles have been described 
previously [6]. Minimum Information about the 
Genome Sequence (MIGS) is provided in Table 1. 

Figure 2 shows the phylogenetic neighborhood of 
Mesorhizobium opportunistum strain WSM2075 T 
in a 16S rRNA sequence based tree. This strain 
clusters in a tight group which included M. 
amorphae, M. huakuii, M. plurifarium and M. 
septentrionale and has >99% sequence identity 
with all four type strains. However, based on a 
polyphasic taxonomic study we have identified 
that this strain belongs to a new species [6]. 

Symbiotaxonomy 

M. opportumistum strain WSM2075 T forms an in- 
effective (non-N fixing) symbiosis with its original 
host of isolation, B. pelecinus L., as well as with 
Astragalus adsurgens, A. membranaceus, Lotus 
peregrinus and Macroptilium atropurpureum [4,6]. 
In all cases the root nodules formed are small, 
white and seem incapable of fixing nitrogen [6]. 
Strain WSM2075 T has a broader host range for 
nodulation than Mesorhizobium ciceri bv. 
biserrulae WSM1271 [6]. 




Figure 1A. Image of Mesorhizobium opportunistum strain 
WSM2075 1 using scanning electron microscopy. 




Figure 1 B. Image of Mesorhizobium opportunistum strain 
WSM2075' using transmission electron microscopy. 




Figure 1C. Image of Mesorhizobium opportunistum 
strain WSM2 075 colony morphology on a solid medi- 
um. 
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Table 1 . Classification and general features of Mesorhizobium opportunistum strain WSM2075 1 according to the 
MIGS recommendations [8,9]. 



MIGS ID Property 



Term 



Evidence code 



MIGS-22 



MIGS-6 

MIGS-15 

MIGS-14 



MIGS-4 

MIGS-5 

MIGS-4.1 

MIGS-4.2 

MIGS-4.3 

MIGS-4.4 



Current classification 



Gram stain 

Cell shape 

Motility 

Sporulation 

Temperature range 

Optimum temperature 

Salinity 

Oxygen requirement 
Carbon source 
Energy source 
Habitat 

Biotic relationship 
Pathogenicity 
Biosafety level 
Isolation 

Geographic location 

Nodule collection date 

Longitude 

Latitude 

Depth 

Altitude 



Domain Bacteria TAS [9] 

Phylum Proteobacteria TAS [10] 

Class Alphaproteobacteria TAS [11,12] 

Order Rhizobiales TAS [12,13] 

Family Phyllobacteriaceae TAS [12,14] 

Genus Mesorhizobium TAS [15] 

Species Mesorhizobium opportunistum TAS [6] 

Negative TAS [6] 

Rod TAS [6] 

Motile TAS [6] 

Non-sporulating TAS [16] 

Mesophile TAS [16] 

2 8°C TAS [6] 

Unknown NAS 

Aerobic TAS [16] 
Arabinose, pgentibiose, glucose, mannitol & melibiose TAS [6] 

Chemoorganotroph TAS [16] 

Soil, root nodule, host TAS [6] 

Free living, Symbiotic TAS [6] 

None NAS 

1 TAS [17] 

Root nodule of Biserrula pelecinus L. TAS [6,18] 

Northam, Western Australia TAS [6,18] 

August 2 000 TAS [4] 

116.947875 TAS [4] 

-31.530408 TAS [4] 

10 cm NAS 

160 m NAS 



Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non- 
traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally 
accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology 
project [19]. 
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63 
59 
32 



86 1 Mesorhizobium loti R7A (Gi08825) 

Mesorhizobium loti MAFF303099 (Gc00040)* 

■Mesorhizobium opportnnistnm WSM2075 T |NR 074209. Gc01853| 

Mesorhizobium loti NZP2037 (Gi08826) 
-Mesorhizobium huakuii LMG14107 T (D13431) 
-Mesorhizobium plurifarium LMG 11892 T (Y14158) 
Mesorhizobium australicum WSM2073 1 (CP003358, Gc02468) 
-Mesorhizobium seplenlrionale CCBAU 11014 T (AF508207) 
Mesorhizobium amorphae ACCC 19665 T (AF041442) 
Mesorhizobium mediterraneum LMG 17148 T (AM181745) 
Mesorhizobium temperatum CCBAU 11018 T (AF508208) 
-Mesorhizobium robiniae CCNWYC 115 T (EU849582) 
Mesorhizobium caraganae CCBAU 11299 T (EF1 49003) 
Mesorhizobium larimense CCBAU 83306 T (EF035058) 
Mesorhizobium melallidurans STM 2B83 T (AM930381) 
Mesorhizobium gobiense CCBAU 83330 T (EF035064) 
Mesorhizobium tianshanense USDA3592 1 " (AF041447) 
— Mesorhizobium chacoense Pr5 T (AJ 278249) 

100 r Mesorhizobium alhagi CCNWXJ12-2 T (EU169578) 

I— Mesorhizobium camelthorni CCNVWJ40-4 T (EU1 69581) 
92 I Mesorhizobium ciceribv. biserrutae WSM1271 (Gc01578) 



99 
76 



72 



'Mesorhizobium ciceri NBRC 100389 7 (AB681164) 
Mesorhizobium toh LMG 6125 T (X67229, Gi08881) 
Mesorhizobium shangrilense CCBAU 65327 T (EU074203) 
Mesorhizobium loti CJ3sym (Gi08828) 
Mesorhizobium loti R88B (Gi08827) 
Mesorhizobium albziae CCBAU 61158 T (DQ100066) 
63 r Ensifer fredii LMG 6217 T (X67231) 

~l Ensifer meliloti LMG 6133 T (X67222) 

Ensifer terangae LMG 7834 T (X68388) 

Ensifer kostiensis LMG 19920 T (AM181748) 

Rhizobium galegae LMG B214 T (X6722B, Gi09589 ) 

-Rhizobium mongolense USDA 1844T (Gi08900 U89817) 

■Rhizobium tropici CIAT899 1 (EU488752, GD5744)* 



96 1 Rhizobium leguminosarum bv. Wc/ae USDA 2370 T (U29386, GiOB483) 

-Azorhizobium caulinodans ORS571 T (D 11 342) 

-Bradyrhizobium liaoningense LMG18230 T (AJ250813) 



52 | £ 
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1 Bradyrhizobium yuanmingense LMG 21827 T (AF19381 8) 
^Bradyrhizobium japonicum USDAB T (Gc02045, U69638) 
-Bradyrhizobium canariense LMG 22265 T (AJ 558025) 
-Bradyrhizobium elkanii USDA 76 T (AB509378, Gi08850) 

100 | Methylobacterium nodulans ORS 2060 T (AF2207B3, Gc00935)* 

Methylobacterium sp. WSM2598 (Gi08887) 

Microvirga zambiensis WSM3B93 T (HM362433) 

r- Microvirga lupini Lut6 T (EF191408, Gi06478) 

Microvirga lotononidis WSM3557 T (HM362432, Gi06493) 



Figure 2. Phylogenetic tree showing the relationships of Mesorhizobium opportunistum 
WSM2075 with other root nodule bacteria in the order Rhizobiales based on aligned sequences 
of the 16S rRNA gene (1,290 bp internal region). All positions containing gaps and missing data 
were eliminated. Phylogenetic analyses were performed using MEGA, version 3.1 [20]. The tree 
was built using the Maximum-Likelihood method with the General Time Reversible model and 
bootstrap analysis [21] with 500 replicates to construct a consensus tree. Type strains are indi- 
cated with a superscript T. Brackets after the strain name contain a DNA database accession 
number and/or a GOLD ID (beginning with the prefix G) for a sequencing project registered in 
GOLD [22]. Published genomes are indicated with an asterisk. 
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Genome sequencing and annotation 

Genome project history 

This organism was selected for sequencing on the 
basis of its environmental and agricultural rele- 
vance to issues in global carbon cycling, alterna- 
tive energy production, and biogeochemical im- 
portance, and is part of the Community Sequenc- 
ing Program at the U.S. Department of Energy, 
Joint Genome Institute (JGI) for projects of 



relevance to agency missions. The genome project 
is deposited in the Genomes OnLine Database [22] 
and the complete genome sequence in GenBank. 
Sequencing, finishing and annotation were per- 
formed by the JGI. A summary of the project in- 
formation is shown in Table 2. 



Table 2. Genome sequencing project information for Mesorhizobium opportunistum WSM2075 T . 
MIGS ID Property Term 



MIGS-31 Finishing quality 
MIGS-28 Libraries used 

MIGS-29 Sequencing platforms 
MIGS-31. 2 Sequencing coverage 
MIGS-30 Assemblers 
MIGS-32 Gene calling method 
Genbank ID 

Genbank Date of Release 
GOLD ID 
NCBI project ID 
Database: IMG 
Project relevance 



Finished 

lllumina GAii shotgun library, 454 Titanium standard library and paired 

end 454 libraries 

lllumina and 454 technologies 

454 std (63. 8x), 454 paired end (91. 5x) and lllumina (1x), total 146.9x 

Velvet, Newbler, phred/Phrap/Consed 

Prodigal, GenePRIMP 

CP0022 79 

January 2 1, 201 1 

Gc01853 

33861 

2503198000 

Symbiotic nitrogen fixation, agriculture 



Growth conditions and DNA isolation 

M. opportunistum strain WSM2075 T was grown to 
mid logarithmic phase in TY rich medium [23] on 
a gyratory shaker at 28°C. DNA was isolated from 
60 mL of cells using a CTAB (Cetyl trimethyl am- 
monium bromide) bacterial genomic DNA isola- 
tion method [24]. 

Genome sequencing and assembly 

The genome of Mesorhizobium opportunistum 
WSM2075 T was sequenced at the Joint Genome In- 
stitute (JGI) using a combination of lllumina [25] 
and 454 technologies [26]. An lllumina GAii shot- 
gun library comprising 370 Mb in reads of 36 ba- 
ses, a 454 Titanium library with read length of 480- 
495 bases containing approximately 1.05 million 
reads, and a paired end 454 library containing 
63840 reads with average insert size of 39 Kb were 
generated for this genome. All general aspects of 
library construction and sequencing performed at 
the JGI can be found at [24]. lllumina sequencing 
data was assembled with VELVET [27], and the 
consensus sequences were shredded into 1.5 Kb 
overlapped fake reads and assembled together 
with the 454 data. Draft assemblies were based on 
375 Mb 454 standard data, and all of the 454 



paired end data. Newbler parameters used were '- 
consed -a 50 -1 350 -g -mi 96 -ml 96'. The initial 
Newbler assembly contained 44 contigs in 1 scaf- 
fold. We converted the initial 454 assembly into a 
phrap assembly by making fake reads from the 
consensus, collecting the read pairs in the 454 
paired end library. The Phred/Phrap/Consed soft- 
ware package was used for sequence assembly and 
quality assessment [28-30] in the subsequent fin- 
ishing process. lllumina data was used to correct 
potential base errors and increase consensus quali- 
ty using software developed at JGI (Polisher, Alia 
Lapidus, unpublished). After the shotgun stage, 
reads were assembled with parallel phrap (High 
Performance Software, LLC). Gaps were closed in 
siliao using software developed at JGI 
(gapResolution, unpublished), and mis-assemblies 
were corrected using Dupfinisher [31], or sequenc- 
ing cloned bridging PCR fragments. Remaining gaps 
between contigs were manually closed by editing in 
Consed, by PCR, and by Bubble PCR primer walks. A 
total of 464 additional reactions and 3 shatter li- 
braries were necessary to close all gaps and to im- 
prove the quality of the finished sequence. 
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Genome annotation 

Genes were identified using Prodigal [32] as part 
of the Oak Ridge National Laboratory genome an- 
notation pipeline, followed by a round of manual 
curation using the JGI GenePrimp pipeline [33]. 
The predicted CDSs were translated and used to 
search the National Center for Biotechnology In- 
formation (NCBI) nonredundant database, 
UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and 
InterPro databases. These data sources were 
combined to assert a product description for each 
predicted protein. Non-coding genes and miscel- 
laneous features were predicted using tRNAscan- 
SE [34], RNAMMer [35], Rfam [36], TMHMM [37], 
and SignalP [38]. Additional gene prediction anal- 
yses and functional annotation were performed 



within the Integrated Microbial Genomes (IMG- 
ER) platform [39]. 

Genome properties 

The genome is 6,884,444 nucleotides with 62.87% 
GC content (Table 3) and comprised of a single 
chromosome and no plasmids. From a total of 
6,747 genes, 6,685 were protein encoding and 62 
RNA only encoding genes. Within the genome, 177 
pseudogenes were also identified. The majority of 
genes (71.11%) were assigned a putative function 
while the remaining genes were annotated as hy- 
pothetical. The distribution of genes into COGs 
functional categories is presented in Table 4 and 
Figure 3. 



Table 3. Genome Statistics for Mesorhizobium opportunistum WSM2075 T . 
Attribute Value % of Total 



Genome size (bp) 

DNA coding region (bp) 

DNA G+C content (bp) 

Number of replicons 

Extrachromosomal elements 

Total genes 

RNA genes 

Protein-coding genes 

Genes with function prediction 

Genes assigned to COGs 

Genes assigned Pfam domains 

Genes with signal peptides 

Genes with transmembrane helices 



6,884,444 



5,948,42 7 



4,328,075 



6,747 
62 
6,685 
4,798 
5,353 
5,595 
610 
1,573 



100.00 



86.40 



62.87 



100.00 
0.92 
99.08 
71.11 
79.34 
82.93 
9.04 
23.31 
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Figure 3. Graphical circular map of the chromosome of Mesorhizobium opportunistum WSM2075 1 . From outside 
to the center: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on re- 
verse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC 
skew. 
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Table 4. Number of protein coding genes of Mesorhizobium opportunistum WSM2075 1 
associated with the general COG functional categories. 
Code Value %age Description 



J 


194 


3.26 


Translation, ribosomal structure and biogenesis 


A 


0 


0 


RNA processing and modification 


K 


515 


8.65 


Transcription 


L 


185 


3.11 


Replication, recombination and repair 


B 


5 


0.08 


Chromatin structure and dynamics 


D 


37 


0.62 


Cell cycle control, mitosis and meiosis 


Y 


0 


0 


Nuclear structure 


V 


63 


1.06 


Defense mechanisms 


T 


227 


3.81 


Signal transduction mechanisms 


M 


315 


5.29 


Cell wall/membrane biogenesis 


N 


50 


0.84 


Cell motility 


Z 


1 


0.02 


Cytoskeleton 


w 


1 


0.02 


Extracellular structures 


u 


131 


2.2 


I ntracel lular traff icki ng and secretion 


o 


208 


3.5 


Posttranslational modification, protein turnover, chaperones 


c 


353 


5.93 


Energy production conversion 


G 


592 


9.95 


Carbohydrate transport and metabolism 


E 


710 


11.93 


Amino acid transport metabolism 


F 


93 


1.56 


Nucleotide transport and metabolism 


H 


217 


3.65 


Coenzyme transport and metabolism 


1 


242 


4.07 


Lipid transport and metabolism 


P 


250 


4.2 


Inorganic ion transport and metabolism 


Q 


191 


3.21 


Secondary metabolite biosynthesis, transport and catabolism 


R 


777 


13.06 


General function prediction only 


S 


594 


9.98 


Function unknown 




1394 


20.66 


Not in COGS 


Total 


5,951 
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